Designing a NooJ Module for Turkish Inflectional Analysis: an Example of Highly Productive Morphology

نویسنده

  • Arianna Bisazza
چکیده

Turkish is a highly inflectional language that represents an interesting challenge to traditional corpus processing techniques. We present here the design of a basic module that allows NooJ users to lemmatize and perform morphological analysis on Turkish texts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Amazighe Verbal Inflectional Morphology: A New Approach for Analysis and Generation

Amazighe inflectional morphology poses special challenges to Natural Language Processing (NLP) systems. Its rich morphology and the highly complex word formation process of roots and patterns make NLP tools for Amazighe very challenging. In this paper we present an approach for inflectional morphological analysis and generation for Amazighe verbs. The main motivation for this work is to obtain ...

متن کامل

Turkish Language Resources: Morphological Parser, Morphological Disambiguator and Web Corpus

In this paper, we propose a set of language resources for building Turkish language processing applications. Specifically, we present a finite-state implementation of a morphological parser, an averaged perceptron-based morphological disambiguator, and compilation of a web corpus. Turkish is an agglutinative language with a highly productive inflectional and derivational morphology. We present ...

متن کامل

Statistical Morphological Disambiguation for Agglutinative Languages

We present statistical models for morphological disambiguation in agglutinative languages, with a specific application to Turkish. Turkish presents an interesting problem for statistical models as the potential tag set size is very large because of the productive derivational morphology. We propose to handle this by breaking up the morhosyntactic tags into inflectional groups, each of which con...

متن کامل

Statistical Dependency Parsing for Turkish

This paper presents results from the first statistical dependency parser for Turkish. Turkish is a free-constituent order language with complex agglutinative inflectional and derivational morphology and presents interesting challenges for statistical parsing, as in general, dependency relations are between “portions” of words – called inflectional groups. We have explored statistical models tha...

متن کامل

Blasting Open a Choice Space: Learning Inflectional Morphology for NLP

This article discusses the various aspects of designing a system for eliciting knowledge about language from informants. For each design aspect, various options for implementation are presented, along with their pros, cons, and repercussions for other parts of the knowledge elicitation system. A running example throughout the text is taken from the paradigmatic morphology elicitation module of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009